OPTIMAL ERROR ESTIMATES 
OF GALERKIN FINITE ELEMENT METHODS 
FOR STOCHASTIC PARTIAL DIFFERENTIAL EQUATIONS 
WITH MULTIPLICATIVE NOISE 



RAPHAEL KRUSE* 

Abstract. We consider Galcrkin finite clement methods for semilinear sto- 
chastic partial differential equations (SPDEs) with multiplicative noise and 
Lipschitz continuous nonlinearities. We analyze the strong error of conver- 
gence for spatially semidiscrete approximations as well as a spatio-temporal 
discretization which is based on a linear implicit Eulcr-Maruyama method. In 
both cases we obtain optimal error estimates. 

The proofs are based on sharp integral versions of well-known error es- 
timates for the corresponding deterministic linear homogeneous equation to- 
gether with optimal regularity results for the mild solution of the SPDE. The 
results hold for different Galcrkin methods such as the standard finite element 
method or spectral Galerkin approximations. 



1. Introduction 

This article is devoted to the analysis of numerical schemes for the discretization 
of stochastic partial differential equations (SPDEs) with multiplicative noise. For 
the last 15 years this has been an very active field of research. An extensive list of 
references can be found in the review article [18]. 

Here we apply the well-established theory of Galerkin finite element methods 
from [27] and, in combination with optimal spatial and temporal regularity results 
[20, 22], we derive optimal error estimates for spatially semidiscrete as well as for 
spatio-temporal approximation schemes. Our analysis is suitable to treat different 
Galerkin methods such as the finite element method or spectral Galerkin methods 
in a unified setting. 

We begin with a probability space (ft, J 7 , P) together with a normal filtration 
(J"t) t6 [ 0j T] C T . By W: [0,T] x D, —> U we denote an adapted Q- Wiener process 
with values in a separable Hilbert space (U, (•, -)u, \\ • \\u)- The covariancc operator 
Q : U — > U is assumed to be linear, bounded, self-adjoint and positive semidefinite. 

Further, let (H, (•, •), || • |j) be another separable Hilbert space and A: D(A) C 
H — > H a linear operator, which is densely defined, self-adjoint, positive definite, 
not necessarily bounded but with compact inverse. Hence, there exists an increasing 
sequence of real numbers (A„)„>i and an orthonormal basis of eigenvectors (e„)„>i 
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in H such that Ae n = A„e n and 

< Ai < A 2 < . . . < A n (-> oo). 
The domain of A is characterized by 



oo 

D(A) = {x G iJ : A^(x,e„) 2 < oo}. 



n=l 



Thus, —^4 is the generator of an analytic semigroup of contractions which is denoted 
by E{t) 



e 



As our main example we have the following in mind: H is the space L (T>), where 
T> C M d is a bounded domain with smooth boundary dT> or a convex domain with 
polygonal boundary. Then, for example, let — A be the Laplacian with homogeneous 
Dirichlet boundary conditions. 

Next, we introduce the stochastic process, which we want to approximate. Let 
X: [0,T] x fi ->• H, T > 0, denote the mild solution [7, Ch. 7] to the stochastic 
partial differential equation 

dX(t) + \AX(t) + f(X(t))} dt = g(X(t)) dW(t), for < t < T, 
(L1) ' X(0) = *„. 

Here / and g denote nonlinear operators which are Lipschitz continuous in an 
appropriate sense. In Section 2 we give a precise formulation of our conditions on 
/, g and Xq, which are also sufficient for the existence and uniqueness of X . 
By definition the mild solution satisfies 

(1.2) X{t) = E(t)X - f E(t- a)f(X(a)) da + f E(t - a)g(X(a)) dW(a) 

Jo Jo 

for all < t < T. 

Our aim is to analyze numerical schemes which are used to approximate the 
solution X. For an implementation one needs to discretize the time interval [0,T] 
as well as the Hilbert spaces H and U, since both are potentially high or infinite 
dimensional. 

In this paper we deal with the discretization of the time interval [0, T] and the 
Hilbert space H . A fully discrete approximation of the mild solution X, which also 
includes the discretization of the space U , will be done in a forthcoming paper. 

Our first result is concerned with a so called spatially scmidiscrcte approximation 
of (1.2), that is, we only discretize with respect to the Hilbert space H. 

By (Sh)he(o,i] we denote a family of finite dimensional subspaces of H consisting 
of spatially regular functions. In our example with H = L 2 (T>), Sh may be a 
standard finite element space or the linear span of finitely many eigenfunctions of 
A (see Examples 3.3 and 3.4). 

Let the stochastic process X^: [0,T] x S! — > Sh solve the stochastic evolution 
equation 

dX h (t) + [A h X h (t) + P h f(X h (t))] dt = P h g{X h (t)) dW(t), for < t < T, 

X h (0) = P h X , 



where Ph denotes the (generalized) orthogonal projector onto Sh and Ah : Sh — > Sh 
is a discrete version of the operator A which will be defined in (3.4). 
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As for the continuous problem (1.1) there exists a unique mild solution Xh to 
equation (1.3) which satisfies 

(1.4) X h (t) = E h (t)P h X - / E h {t - a)P h f(X h (a)) da 

Jo 

+ f E h (t - a)P h g(X h (a)) dW(a) for < t < T. 
Jo 

This paper deals with the strong error of convergence. Here, strong convergence 
is understood in the sense of convergence with respect to the norm 

\\Z\\ LP{n , H) = (E[\\Z\\ P ])K p>2, 

where E is the expectation with respect to P. Therefore, strong convergence indi- 
cates a good pathwise approximation. 

In many applications the aim is to approximate the expectation E[y(X(T))], 
where tp is a smooth observable. This leads to the concept of weak convergence 
which, in the context of SPDEs, is considered in [9, 8, 11, 15]. However, as it 
was shown by Giles [12, 13], the strong order of convergence is also essential for 
developing efficient multilevel Monte Carlo methods for applications where the weak 
approximation is of interest. 

Before we formulate our first main result let us explain two of the most crucial 
parameters. First, we have the parameter r G [0, 1) which controls the spatial 
regularity of the mild solution X. On the other hand, the parameter h <G (0, 1] 
governs the granularity of the spatial approximation. In our example with H = 
L 2 (T>) and under the given assumptions, the mild solution X(t) maps into the 
fractional Sobolev space Hq(T>) n H 1+r (T>) and h denotes the maximal length of 
an edge in a partition of T> into simplices. 

Theorem 1.1. Under the assumptions of Section 2 with r € [0,1), p € [2, oo), and 
Assumption 3.1 there exists a constant C, independent of h € (0, 1], such that 

\\X h (t)-X(t)\\ LP(n:H) < Ch 1+r , for all t€ {0,7], 

where Xh and X denote the mild solutions to (1.3) and (1.1), respectively. 

Therefore, in our example of a standard finite element semidiscretization, the 
approximation Xh converges with order 1 + r to the mild solution X. Since this 
rate coincides with the spatial regularity of X it is called optimal (see [27, Ch. 1]). 

We stress that, to the best of our knowledge, in all articles which deal with the 
numerical approximation of semilinear SPDEs the obtained order of convergence is 
of the suboptimal form 1 + 1 — e for any e > (see [29] or [14], where also stronger 
Lipschitz assumptions have been imposed on /, g) or the error estimates contain a 
logarithmic term of the form \og(t/h) as in [21]. 

Next, we consider a spatio-temporal discretization of the stochastic partial dif- 
ferential equation (1.1). Let k £ (0, 1] denote a fixed time step which defines a time 
grid tj = jk, ] = 0, 1, . . . , N k , with N k k<T < (N k + l)k. 

Further, by X J h we denote the approximation of the mild solution X to (1.2) at 
time tj. A combination of the Galerkin methods together with a linear implicit 
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Euler-Maruyama scheme results in the recursion 
(1.5) 

X{ - Xj- 1 + k(A h X{ + P h f{Xl 1 )) = P h g(xt 1 )AW j , for j = 1, ... , N k , 

with the Wiener increments AW 3 := W(tj) — W(tj—i) which are Tu -adapted, 
U- valued random variables. Consequently, X 3 h is an !F tj -adapted random variable 
which takes values in Sh- 

Our second main result is the analogue of Theorem 1.1. 

Theorem 1.2. Under the assumptions of Section 2 with r G [0,1), p € [2, oo), and 
Assumptions 3.1 and 3.2 there exists a constant C, independent of k,h € (0,1], 
such that 

\\Xi-X(t j )\\ LHQ . H) <C(h 1+r + ki), for allj = l,...,N k , 

where X denotes the mild solution (1.2) to (1.1) and X 3 h is given by (1.5). 

As above, we obtain the optimal order of convergence with respect to the spatio- 
temporal discretization. Since we only use the information of the driving Wiener 
process which is provided by the increments AM-^, it is a well-known fact [5] that 
the maximum order of convergence of the implicit Euler scheme is 4 . It is possible 
to overcome this barrier if one considers a Milstcin-like scheme as discussed in the 
recent paper [19]. 

In our error analysis we use the results from [27, Ch. 2 and 3] which are stated 
under the assumption that —A is the Laplace operator with homogeneous Dirichlet 
boundary conditions. But all proofs and techniques also hold in our more general 
framework of a sclf-adjoint, positive definite operator A. Further, we use generic 
constants which may vary at each appearance but are always independent of the 
discretization parameters h and k. 

The structure of this paper is as follows: In the next section we introduce some 
additional notations and formulate the assumptions on /, g and Xq which will be 
sufficient for the existence of the unique mild solution X to (1.1) as well as for the 
proofs of Theorems 1.1 and 1.2. In Section 3 we give a short review of Galerkin 
finite element methods and we also introduce two additional assumptions on the 
choice of the family of subspaces (Sh)h£(o.i}- As already mentioned, Section 3 also 
contains two concrete examples of a spatial discretization. 

In Section 4 we present several lemmas which play a crucial role in the proofs 
of our main results. All lemmas are concerned with the spatially semidiscrete and 
fully discrete approximation of the deterministic homogeneous equation (4.1). We 
prove extensions of well-known convergence results from [27] to non-smooth initial 
data as well as sharp integral versions. 

While Sections 5 and 6 are devoted to the proofs of Theorems 1.1 and 1.2, 
respectively, the final section revisits the special case of SPDEs with additive noise. 

2. Preliminaries 

In order to formulate our assumptions on /, g and Xq we introduce the notion 
of fractional powers of the linear operator A in the same way as in [26, 22]. After 
fixing some additional notation we give a precise formulation of our assumptions 
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and cite the result on the existence of a unique mild solution to (1.1) from [20] and 
a corresponding regularity result from [22] . 

For any r G R the operator A? : D(A?) — > H is given by 

oo 

A*x = 2J \nX n e n 

n=l 

for all 

oo oo 

x G D(A%) = ix = x n e n : {x n ) n >i C R with ||a;||^ := ^ K x n < °°}- 

n— 1 n— 1 

By defining H r := D(A5) together with the norm j| • || r for rel, H r becomes a 
Hilbert space. Note that we have |jx|j r = ||A5.t|| for all r <E R and x € H r . 

As usual [7, 25] we introduce the separable Hilbert space Uq := Q^{U) with 
the inner product (uq,Vo)u := (Q ~^ u 0i Q ~^ v o)u- Here denotes the pseu- 

doinverse. Further, the set L\ denotes the space of all Hilbert-Schmidt operators 
4> : ?7o — > H with norm 

oo 

||$||£o := E ll^-l! 2 ' 

m— 1 

where (ip m ) m >i is an arbitrary orthonormal basis of Uq (for details see, for example, 
Proposition 2.3.4 in [25]). We also introduce the subset L\ r C r > 0, which is 
the subspace of all Hilbert-Schmidt operators $ : Uq — > £T r together with the norm 
j|$|| L o r := ||Ai$|| L o. 

Let r G [0, 1], p G [2, oo) be given. As in [20, 22, 26] we impose the following 
conditions on /, g and Xq. 

Assumption 2.1. The nonlinear operator f maps H into H~ 1+r and there exists 
a constant C such that 

\\f{x) - /(y)||_i +r < C||x-y|| for all x,y G H. 

Assumption 2.2. The nonlinear operator g maps H into L\ and there exists a 
constant C such that 

(2-1) \\g{x)-g{y)\\ L o<C\\x-y\\ Wx,y G H. 

Furthermore, we have that g(H r ) C 1^ and 

(2.2) \\g{x)\\ L » r <C {l + \\x\\ r ) forallx£H r . 

Assumption 2.3. The initial value Xo : ft — > H r+1 is an To-measurable random 
variable with E [H-X'ollr+i] < °o. 

Under these conditions, by [20, Theorem 1], there exists an up to modification 
unique mild solution X : [0, T] x Q, — > H to (1.1) of the form (1.2). Furthermore, by 
the regularity results in [22], it holds true that for all s G [0, r + 1] with r G [0, 1), 
we have 

(2-3) sup Epr(t)||?] <oo 

te[o,T] 

and there exists a constant C such that 

(2.4) (V[\\X(h) - X(t 2 W s ])» ^Clh-hr^'^) 

for all h,t 2 £ [0,T]. 
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Remarks. 1.) Of course, Assumption 2.1 is satisfied under the usual condition 
that the mapping / is Lipschitz continuous from H to H. The reason for our 
slightly weaker assumption is that under this condition the order of the spatial 
discretization error will numerically behave in the same way for both integrals, the 
Lebesgue integral which contains / and the stochastic integral which contains g. 

In addition, Assumption 2.1 applies to partial differential equations where a 
fractional power of the operator A is situated in front of a Lipschitz continuous 
mapping /: H — > H as, for example, in the Cahn-Hilliard equation. 

2.) Assumption 2.3 can be relaxed to Xq: Vl — > H being an .Fo-measurable 
random variable with E[||Ao|| p ] < oo. But, as with deterministic PDEs, this will 
lead to a singularity at t = in the error estimates. 

We complete this section by collecting useful facts on the semigroup E(t). The 
smoothing property (2.5) and Lemma 2.4 (ii) arc classical results and proofs can 
be found in [24]. A proof for the remaining assertions is given in [22]. 

Lemma 2.4. For the analytic semigroup E(t) the following properties hold true: 

(i) For any v > there exists a constant C = C(y) such that 

(2.5) \\A v E{t)\\ < Cr v for all t > 0. 

(ii) For any < v < 1 there exists a constant C = C{v) such that 

\\A-"(E(t) - I)\\ < Ct v for all t > 0. 
(Hi) For any < v < 1 there exists a constant C = C{y) such that 

/ \\A^E(t 2 - a)x\\ 2 da < C(r 2 - n) 1 ^ \\x\\ 2 for all x e H, < r x < r 2 . 

Jti 

(iv) For any < v < 1 there exists a constant C = C(y) such that 

/•T2 

A v \ E(T 2 ~o-)xdcr <C{t 2 -ti) 1 ~ v \\x\\ for allx&H,Q<Ti<T 2 . 

3. A SHORT REVIEW OF GALERKIN FINITE ELEMENT METHODS 

In this section we first review the the Galerkin finite element methods used for 
the discretization of the Hilbert space H. Following [27, Ch. 2 and 3] we recall the 
definition of several discrete operators which are connected to a sequence of finite 
dimensional subspaces of H 1 . We close this section with two concrete examples, 
namely the standard finite' clement method and a spectral Galerkin method. 

Let (S , / l ) ftg ( be a sequence of finite dimensional subspaces of H 1 and denote 
by Rh- H 1 — s- Sh the orthogonal projector (or Ritz projector) onto Sh with respect 
to the inner product 

a(x,y) := (A^x, A^y), ioi x,y ^ H 1 . 

Thus, we have 

a(R h x,y h ) = a(x,y h ) for all x G H 1 , y h G S h . 

Throughout this paper we assume that the spaces (Sh)he(o,i] > satisfy the follow- 
ing approximation property. Below we present two examples of A 7 H and spaces 
(Sh)h£(o,i] which fulfill this assumption. 
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Assumption 3.1. Let a sequence [Sh)he(o,i] of finite dimensional subspaces of H 1 
be given such that there exists a constant C with 

(3.1) \\R h x-x\\ < C7i s ||z|| s for all x £ H" , s £ {1,2}, h £ (0,1]. 

Remark. Following [23, Ch. 5.2] or [27, Ch. 1] consider the linear (elliptic) problem 
to find x € D(A) = H 2 such that Ax = z holds for a given z £ H . The weak or 
variational formulation of this problem is: Find x £ H l which satisfies 

(3.2) a(x,y) = (z,y) for all y £ H 1 . 

For a given sequence of finite dimensional subspaces (Sh)he(o,i] the Galerkin ap- 
proximation Xh £ Sh of the weak solution x is given by the relationship 

(3.3) a{x h ,y h ) = (z,y h ) for all yu £ Sh- 

Note that by the representation theorem x £ H 1 and Xh £ Sh are uniquely deter- 
mined by (3.2) and (3.3). By the definition of the Ritz projector and since (3.2) 
holds for all yh £ Sh we get 

a(R h x,y h ) = a{x,y h ) = a{x h ,y h ) for all y h £ Sh- 

This yields RhX = Xh, that is, the Ritz projection RhX coincides with the Galerkin 
approximation of the solution x to the elliptic problem. Hence, Assumption 3.1 is 
a statement about the order of convergence of the sequence {xh)he(o,i} 1,0 x - 

Next, we introduce the mapping Ah : Sh — > Sh, which is a discrete version of the 
operator A. For x^ £ Sh we define AhXh to be the unique element in Sh which 
satisfies the relationship 

(3.4) a(x h ,yh) = (A h x h ,yh) for all y h £ S h . 
Since 

(A h x h ,yh) = a{x h ,Vh) = {xh,A h yh) for all x h ,Vh G Sh, 

as well as 

(A h x h ,x h ) = a(x h ,x h ) = \\xh\\l > for all x h £ Sh, x h ^ 0, 

the operator Ah is self- adjoint and positive definite on Sh- Hence, — Ah is the 
generator of an analytic semigroup of contractions on Sh, which is denoted by 
Eh(t) = e~ Aht . Let p > 0. Similar to [27, Lemma 3.9] one shows the smoothing 
property 

(3.5) \\A p h Eh(t)y h \\ < Ct-P \\y h \\ for all t > 0, 

where C = C(p) is independent of h £ (0, 1]. Additionally, by the definition of Ah, 
it holds true that 

(3.6) WAlijhf = a{y h ,y h ) = \\A^y h \\ 2 = \\y h \\i for all y h £ S h C H 1 . 

Finally, let Ph : H^ 1 Sh be the generalized orthogonal projector onto Sh (see 
also [4]) defined by 

(PhX, yh) = (x, yh) for all x £ H' 1 , y h £ Sh, 

where (-,-) = a(^4 -1 -,-) denotes the duality pairing between H^ 1 and H . By 
the representation theorem, Ph is well-defined and, when restricted to H , coincides 
with the standard orthogonal projector with respect to the if-inner product. 
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These operators are related as follows: 
(3.7) K lp h x = RhA^x for all x £ H' 1 

since 

a(R h A~ 1 x, y h ) = a(A _1 a;,2/fc) = (x,y h ) = (PhX,yh) = P h x,y h ) 
for all x € H~ l , yh & Sh- Furthermore, it holds that 

, M -ip I, \{Al h P h x,z h )\ \{P h x,A-h h )\ 

\\A h 2 P h x\\= SUp J -r— !■= sup J " 

^ 38 ^ z h es h \\Zh\\ z h es h \\Zh\\ 

\( x > z 'h)\ ^ mm NI-iKlli 



sup J — ; < sup j — 2 — = j I a- j I — i , 

^7' II 
/i~/ill 



for all x E H 1 , where the last equality is due to (3.6). Having established this we 
also prove the following consequence of (3.5) 

(3.9) \\E h (t)P h x\\ = \\AlE h {t)Al h - P h x\\ < cH \\ A~ h P h x\\ < cH^H^ 

for all x 6 H~ x , t > and h e (0, 1]. 

The following assumption, which is concerned with the stability of the projector 
Ph with respect to the norm || • |i, will mainly be needed for the proof of Lemma 
4.4 (ii) below and, consequently, also for the proof of Theorem 1.2. 

Assumption 3.2. Let a family (Sh)h£(o,i] of finite dimensional subspaces of H 1 
be given such that there exists a constant C with 

(3.10) \\Phx\\i < C\\x\\i for all x e H 1 , he (0,1]. 

We conclude this section with two examples which satisfy Assumptions 3.1 and 
3.2. 

Example 3.3 (Standard finite element method). Assume that H = L 2 (T>), where 
D C l d , (i = 1,2, 3, is a bounded, convex domain (a polygon if d = 2 or a polyhedron 
if d = 3). Let the operator A be given by Au = —V • (a(x)\ 7 u)+c(x)u with c(x) > 
and a(x) > ao > for with Dirichlet boundary conditions. In this case it is 

well-known (for example, [23, Theorem 6.4] and [27, Ch. 3]) that H 1 = Hq(T>) and 
H 2 = H 2 (V) n HKV), where H k (V), k > 0, denotes the Sobolev space of order 
k and Hq(T>) consists of all functions in i? 1 (D) which are zero on the boundary. 
Furthermore, the norms in H k {T>) and H k are equivalent in H k for k = 1,2 (see 
[27, Lemma 3.1]). 

Let (Th)he(o,i] denote a regular family of partitions of T> into simpliccs, where h 
is the maximal meshsizc. We define Sh to be the space of all continuous functions 
yh : T> — > R, which are piecewise linear on Th and zero on the boundary dT>. Then 
Sh C H 1 and Assumption 3.1 holds by [27, Lemma 1.1] or [1, Theorem 5.4.8]. 

Further, if the family (7~h)h£(o,i] is quasi-uniform then also Assumption 3.2 is 
satisfied. But for a more detailed discussion of Assumption 3.2 in the context of 
the finite element method we refer to [2, 3, 6]. 



Example 3.4 (Spectral Galerkin method). In the same situation as in Example 
3.3 we further assume that T> = (0, 1) C R and —A is the Laplace operator with 
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homogeneous Dirichlet boundary conditions. In this situation the orthonormal 
cigcnfunctions and eigenvalues of the Laplace operator are explicitly known to be 

A fc = k 2 n 2 and e k {y) = V2sin(fc7ry) for all k € N, k > 1, y £ V. 

_ i 

For N e N set h := and define 

Sh ■= span{e fc : k = 1 , . . . , N}. 

Note that Sh C H r for every rgl For x e H 1 we represent the Ritz projection 
RhX <E Sh in terms of the basis (ek)^ =1 - This yields RhX = J2 k =i x k e ki where the 
coefficients (x^)^ =1 are given by 

x\ = (R h x,e k ) = ^-(R h x,Ae k ) = -^-a(R h x 7 e k ) = -^-a(x,e k ) = (x,e k ). 
Afc Afe Afc 

Hence, in this example, the Ritz projector Rh is the restriction of the orthogonal 
L 2 -projector Ph to H 1 . Moreover, we have 



\\(I - R h )x\\= - P h )x\\= J2 (^ e fe) 2 = E V(^ f efc) 2 

k=N+l k=N+l 

oo 

^ V+i £ ( A > e fe) 2 = h2p M\l fOT aU x e H", p = 1, 2, 

fc=Ar+l 

since A^ 1 < A^y 1 ^ = h 2 for all k > N + 1. Therefore, Assumption 3.1 is satisfied 
for the spectral Galerkin method. 

That Assumption 3.2 holds is easily seen by 



\Phx\\l 



N 2 N 

e fc )e fc = (A^x, e fe ) 2 < Hxj^ for all x e if 1 . 



fe=i fe=i 
A detailed presentation of spectral Galerkin methods is found in [17]. 

4. Error estimates of Galerkin methods for deterministic equations 

This section extends error estimates from [27] for the discretization of the deter- 
ministic linear homogeneous equation 

(4.1) di"^ + Au ^ = °' 1 > °' with = X ' 

to non-smooth initial data x G H . We will also present suitable integral version 
of these estimates which are crucial for the derivation of the optimal error estimates. 

4.1. Error estimates for a spatially semidiscrete approximation. The fol- 
lowing two lemmas provide some useful estimates of the operator Fh , which is given 
by F h {t) := E h {t)Ph - E(t), t > 0. Note that F h {t)x can be seen as the error at 
time t > between the weak solution u to (4.1) and Uh which solves the spatially 
semidiscrete equation 

4ru h (t) + A h u h (t) = 0, t > 0, with u h (0) = P h x, 

for x e i? -1 . 
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Lemma 4.1. Under the Assumption 3.1 the following estimates hold for the error 
operator Fh ■ 

(i) Let < v < \i < 2. Then there exists a constant C such that 

\\F h (t)x\\ < ChH-^WxWv for all x G H v , t> 0, he (0,1]. 

(ii) Let < p < 1. Then there exists a constant C such that 

\\F h (t)x\\ < Ct-%\\x\\- p for all x e £r p , t > 0, he (0,1]. 

(m,) Let < p < 1. Then there exists a constant C such that 

\\F h (t)x\\ < Ch 2 - p t- l \\x\\- p for all x e H~ p , t > 0, h G (0,1]. 

Proof. The proof of estimate ^ can be found in [27, Theorem 3.5]. 

In order to prove (ii) we first note that the case p = is true by (i). Lemma 2.4 
(i) yields 

(4.2) \\E{t)x\\ = \\AlE(t)A-ix\\ < C*-4||a:||_i. 

Together with (3.9) this proves 

II^W^II < \\E h (t)P h x\\ + \\E(t)x\\ < Ct-^WxW-! 

for all x e H^ 1 . This settles the case p = 1. The intermediate cases follow by the 
interpolation technique which is demonstrated in the proof of [27, Theorem 3.5]. 

For (Hi) the case p = is again covered by (i). Thus, it is enough to consider 
the case p = 1. First, by using (3.7), (3.5), and (3.1), we observe that 

\\F h (t)x\\ = \\A h E h {t)A- l P h x- AE{t)A^x\\ 

< WAhEhitjPhiRhA^x - A^xMl + \\{A h E h {t)P h - AE^A^xW 



< Ct- 1 \\(R h -I)A- 1 x\ 



dF h 
dt 



(^A^x 



< Ct^hllA-h 



dt 



{t)A- l x 



Since ||A 1 x||i = ||.t||_i the first term is already in the desired form. The last term 
is estimated by a slightly modified version of [27, Theorem 3.4], which gives 



dF, 
~~dt 



-(^A^x 



This proves the case p = 1 and the intermediate cases follow by interpolation. □ 

Lemma 4.2. Let < p < 1. Under Assumption 3.1 the operator Fh satisfies the 
following estimates. 

(i) There exists a constant C such that 

[ F h {o-)xda <Ch 2 - p \\x\\- p for all x e H~ p , t > 0, he (0,1]. 
Jo 

(ii) There exists a constant C such that 



F h (o-)x\\ 2 da) 2 < Ch 1+P \\x\\ p for allxeH p , t > 0, he (0,1]. 
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Proof. As in the proof of the previous lemma it is enough to show the estimates 
for p = and p = 1. Then the intermediate cases follow by interpolation. 

The proof of (i) with p = is contained in the proof of [27, Theorem 3.3], where 
the notation 



F h (a)x do- 



it 



is used. 

Here we present a proof of (i) with p = 1. To this end we use (3.7) and find the 
estimate 

ft 



F h {a)xda 



o 



< 



/ (A h E h (a)AfP h - AE(a) A- 1 ) x Ad- 
Jo 

I A h E h {a)P h {R h -l)A- 1 xda 
Jo 

f (A h E h (a)P h - AE(a)) A- 1 x do- 
Jo 

f ^{a)P h {R h -l)A- 1 xda + f ^(a)A~ 1 xda 
Jo da Jq da 



By the fundamental theorem of calculus, ||-P/i2/|| < \\y\\ for all y G H, and Assump- 
tion 3.1 we have for the first term 

J ^(a)P h (R h -l)A- 1 xda\\ = \\(E h {t) - I) P h (R h - I) A' 1 x\\ 

< ChWA^x^ = C7i||a;||-i. 
For the second term we use Lemma 4.1 (i) with p = v = 1. This yields 



J^^(a)A- 1 xda\\ = \\(F h (t) - F^A^xl 



< WFhitfA^xW + ||(7 -P^A^xW < ChWxW-!. 

In the last step we used the best approximation property of the orthogonal projector 
Ph, which, together with (3.1), gives 

\\(Ph - I)y\\ < \\(Rh - I)y\\ < Ch\\y\\i for all y e H\ 
It remains to prove (ii). From [27, (2.28)] we have the inequality 

\ \\F h (a)x\\ 2 da < / \\(R h -I)E{a) x\\ 2 da. 
Jo Jo 

In both cases, p <E {0, 1}, we have by (3.1) 



\(R h -I)E{o)x\\ < Ch 1+p \\E{a) 



\i+p 



Ch 1+p \\A^E{a)A%: 



Applying Lemma 2.4 (Hi) with v = 1 completes the proof. 



□ 



4.2. Error estimates for a fully discrete approximation. In this subsection 
we consider a fully discrete approximation of the homogeneous equation (4.1). Our 
method of choice is a combination of the spatial Galcrkin discretization together 
with the well-known implicit Euler scheme. As in Subsection 4.1 let a family of 
subspaces (S/i)fte(0,i] C H 1 be given. The fully discrete approximation scheme is 
defined by the recursion 



(4.3) 



U J h + kA h Ui = Ul-\ j = 1,2,... with U% = P h x. 
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Here k £ (0, 1] denotes a fixed time step and U 3 h £ Sh denotes the approximation 
of u{tj) at time tj = jk. A closed form representation of (4.3) is given by 

(4.4) U{ = (I + kA h )-ip h x, j = 0,1,2,.... 

In order to make the results from [27, Ch. 7] accessible and to indicate generaliza- 
tions to other onestep methods onestep methods we introduce the rational function 

R(z) = — !— for z e M, z + 1. 

w 1 + z ' r 

By R(kAh) we denote the linear operator which is denned by 

N h 

(4.5) R{kA h )x = R(k\ h , m )(x 

.in 5 

m— 1 

where {^h,m)m=i are ^ ne positive eigenvalues of Ah : Sh — > Sh with corresponding 
orthonormal eigenvectors {iph,m) m =i c an d dim(5h) = Nh- With this notation 
(4.3) is equivalently written as 

(4.6) XJ{ = R(kA h yP h x, j = 0, 1, 2, . . . . 

The characteristic function R of the implicit Euler scheme enjoys the following 
properties with q = 1: 

R(z) = e~ z + 0(z q+1 ) forz^O, 
< ~ 4 ' 7 ' ) \R(z)\ < 1 for all z > 0, and lim R(z) = 0. 

z— >oo 

In the nomenclature of [27, Ch. 7] the rational function R(z) is an approximation 
of e~ z with accuracy q = 1 and is said to be of type IV. A onestep scheme, whose 
characteristic rational function possesses the properties (4.7), is unconditionally 
stable and satisfies, for p £ [0, 1], the discrete smoothing property 

(4.8) \\A p h R(kA h y x h \\ < CtT'Haifcll for all j = 1,2,... and x h £ S h , 

where the constant C = C(p) is independent of k,h and j. For a proof of (4.8) we 
refer to [27, Lemma 7.3]. 

Further, as in the proof of [27, Theorem 7.1] it follows from (4.7) that there 
exists a constant C with 

(4.9) \R(z) - e~ z \ < Cz q+1 for all z £ [0, 1] 
and there exists a constant c £ (0,1) with 

(4.10) \R(z)\ < e~ cz for all z £ [0, 1]. 

The rest of this subsection deals with estimates of the error between the discrete 
approximation U ] h and the solution u(tj). For the error analysis in Section 6 it will 
be convenient to introduce an error operator 

F kh {t) :=E kh {t)P h -E(t), 

which is defined for all t > 0, where 

(4.11) E kh {t) :=R(kA h y, if t € [tj-utj) for j = l,2,... . 

The mapping 1 1— > Ekh{t), and hence 1 1— > F k h{t), is right continuous with left limits. 
A simple consequence of (4.8) and (3.8) is the inequality 

(4.12) \\E kh (t)P h x\\ = \\AlR{kA h y Al h P h x\\ < C*^* ||ar|| —1 < Ct~i \\x\\_ v 
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which holds for all x £ h,k e (0, 1] and t > with t £ [tj-i,tj), j = l,2,.. .. 

The following lemma is the time discrete analogue of Lemma 4.1. 

Lemma 4.3. Under Assumption 3.1 the following estimates hold for the error 
operator F k h ■ 

(i) Let < v < p < 2. Then there exists a constant C such that 
\\F kh (t)x\\ < C(h^ + k^)t~ B ^\\x\\ v for all x £ H", t > 0, h, k £ (0,1]. 

(ii) Let < p < 1. Then there exists a constant C such that 

\\F kh (t)x\\ < C<"^||a;||_ p for all x G H~ p , t > 0, h,k £ (0, 1]. 
(Hi) Let < p < 1. T/ien i/iere exists a constant C such that 
\\F kh {t)x\\ < C(/i 2 " p + fc 2 ^ £ )t _1 ||a;||_ p for all x G t > 0, h,k <E (0,1]. 

Proof, (i) Let i > be such that < t < tj and ir € i? 1 '. Then we get 

||-F«.(t)s|| < || (fl(fci4fc) J 'i^-S(* 3 -))a;|| + || 
For the second summand we have by Lemma 2.4 (i) and (zzj 

||(-B(tj) a;| | = \\A-^(E(tj -t) - l)A^ E{t)A% x\\ 

< C{tj - t^r 1 ^ \A%x\ < Ck^r^ \x\ v . 

Further, the first summand is the error between the exact solution u of (4.1) and 
the fully discrete scheme (4.6) at time tj. For the case p, = v = 2, [27, Theorem 
7.8] gives the estimate 

\\(R{kA h y P h - E(tj))x\\ < C(h 2 + k)\\x\\ r 

By the stability of the numerical scheme, that is (4.8) with p = 0, we also have the 
case /i = v = 0. Hence, 

(4.13) \\F kh (tj)x\\ < C\\x\\, 

and, as above, the constant C is independent of h, k G (0, 1], tj > 0, and x. The 
same interpolation technique, which is used in the proof of [27, Theorem 7.8], gives 
us the intermediate cases for p — v and p G [0, 2], that is 

(4.14) || (R(kA h yP h - E(tj))x\\ < C(h" + k%)\\x\\ . 

On the other hand, [27, Theorem 7.7] proves the case v = and p = 2. Hence, we 
have 

\\(R(kA h ) j P h -E(tj))x\\ < C(h 2 + k)t^\\x\l 

where the constant C is again independent of h, k € (0, 1], tj > 0, and x G H . An 
interpolation between this estimate and (4.13) shows 

(4.15) \\(R(kA h yP h - E(tj))x\\ < Cfif + k^tJ* \\x\\ 

for all p £ [0, 2]. For fixed p G [0, 2] the proof of (i) is completed by an additional 
interpolation with respect to v G [0,/^] between (4.14) and (4.15) and the fact that 

The proof of fiij works analogously. The case p = is true by (4.13) and the 
case p = 1 follows by (4.2) and (4.12), since 

||*fch(*H| < \\E kh (t)P h x\\ + \\E(t)x\\ < cH||x|| . 
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The intermediate cases follow by interpolation. 

For (Hi) the case p = is already included in (i) with p, = 2 and v = 0. Thus, 
it remains to show the case p = 1. For t > with tj^i < t < tj we have 

||*fch(tHI < ||(i2(fc4h) j '-£fc(ti))ite|| + iK^fe)^-^-))^!! 

+ || (E(tj) - E(t))x\\ =: Ti + T 2 + T 3 . 

As in (4.5) we denote by (Xh,m)m=i the positive eigenvalues of Ah with cor- 
responding orthonormal eigenvectors (</>fc,m.) m Li C For T\ we use the expansion 
of PhX in terms of (y/i,m) m =i- This yields 



m— 1 

= *h, m \R(k\h, m ) j -e- kXh '™ j \ 2 (A h 2p h x,<p h , m ) . 



First, we consider all summands with kXh.m < 1- As in the proof of [27, Theorem 
7.1], by applying (4.9) with q = 1 and (4.10), we get 

(4.16) 

i-i 

|iZ(AA h , ra )j - e~ fcAh - mi | = (i?(fcA fe>ro ) - e- kX ^)^2R(kX h , m y- l - i e- kX ^ 

i=0 

Therefore, since tj = jk and kXh, m < 1 it holds true that 

Xh, m \R(kX h , m y -e' kX '^\ 2 < C(jk)- 2 k 2 Xh, m {jkX h , m ) 4 e- 2c 3 kX ^e 2ckX ^ 

< CtJ 2 k, 

where we also used that sup z>0 z A e~ 2cz < oo. 

For all summands with kXh, m > 1 we get the estimate 

Xh, m \R(kX h , m y - e ~ kX >^\ 2 < 2k- 1 (kX h , m ) 2 (\R(kX h , m y\ 2 + |e- fcA ^'| 2 ). 

As it is shown in the proof of [27, Lemma 7.3], we have 

(4.17) \R(z)\< — - — , for all z > 1, with c > 0. 

1 + cz 

In fact, for the implicit Eulcr scheme this is immediately true with c = 1, but it 
also holds for all rational functions R{z), which satisfy (4.7). 
Together with kXh.m > 1 this yields 

\kX h . m R(kX h , m y\ 2 < ( * X i? ) 2 (l + cfcA fe , ro )- 2(j - 1) 

< JL(i + c)- 2 °- 1) = Ic-^- 1 ) 1 ^^) < cr 2 . 

As above we also have 

\kX h , m e- kXh ' mj \ 2 <Cj~ 2 . 
Therefore, also in the case kXh.m > 1, it holds that 

X h , m \R(kX h , m y - e- kX ^\ 2 < Ctfk. 
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Together with Parseval's identity and (3.8) we arrive at 

oo 

I? < Ctfk J2 (A;Jp h x, Vh , m ) 2 = CtJ 2 k\\A^P f 



x\\ 2 < cr 2 k\\x\\ 2 



The term T2 is covered by Lemma 4.1 (Hi) which gives 

T 2 = ||.F/,(*jH| < C^r 1 ||x||_ 1 < C/it- 1 ||a;||_ 1 . 

Finally, for T3 we apply Lemma 2.4 (i) with v = 1 and (ii) with v = i and get 

T 3 = \\AE(t)A-i(E(tj -t) -l)A~%x\\ < Ct~ l {tj — ||ar|j _ 1 < Gr l k^ \\x\\_ r 

Combining the estimates for T\. T2 and T3 proves (Hi) with p = 1. As usual, the 
intermediate cases follow by interpolation. □ 

We also have an analogue of Lemma 4.2. A time discrete version of Lemma 4.4 
(ii), where the integral is replaced by a sum, is shown in [29]. 

Lemma 4.4. Let < p < 1. Under Assumption 3.1 the operator F k h satisfies the 
following estimates. 

(i) There exists a constant C such that 
rt 

F kh (a)xda < C(h 2 ~ p + k^ E )\\x\\_ p 

for all x G H~ p , t > 0, and h,k G (0, 1] . 

(ii) Under the additional Assumption 3.2 there exists a constant C such that 

\\F kh (a)x\\ 2 da) 2 < C(h 1+P + k 1 ¥)\\x\\ 



for all x G ti~P, t > 0, and h,k G (0, 1]. 

Proof. The proof of (i) uses a similar technique as the proof of Lemma 4.3 (Hi). 
First, without loss of generality, we can assume that t — t n for some n > 0. In fact, 
if t n < t < t n+ \ then we have 



F kh (a)xda 



< 



F kh (a)xda 



F kh (a)xda 



F kh (t)xda 



For the second term we get by Lemma 4.3 (Hi) 
jf F kh (a)xdaj < (F kh (a)-F kh (t))xda 

(E(a) - E{t))xda + (t - t n )\\F kh (t)x\\ 

E(t n )A% E{a - t n )A~ixda + (t - t n )\\A% E(t)A~%x\\ 

+ C(t - t n ){h 2 - p + k^ E )t- 1 \\x\\_ p . 

We continue by applying Lemma 2.4 (Hi) and (i) with v = £ and the fact that 
(t — t„)£ _1 < 1 which yields 

F kh (a)xda < Cdt-tnf-^ +{t-t n )t-% +h 2 - p + k^ E )\\x\\_ 



< 



< 



C(h 2 - p + fcV)| 
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Next, we have 



f F kh (<j)xda < f (E kh (a) -E h {a))P h xda 
Jo Jo 



(E h (a)P h -E(a))xda 



For the second term Lemma 4.2 (i) yields the bound 



(E h (a)P h -E(a))xda 
Thus, it is enough to show that 

(E kh (a)-E h (a))P h xda 



<ch 2 -p\\ x \\- c 



< Ck~^\\x\ 



where the constant C = C(p) is independent of h, k <E (0, 1], t > 0, and x £ H p . 
We plug in the definition of E k h and obtain 

(4.18) 



(E kh (a) - E h (a))P h xda < { R ( kA h) j - E h {tj))P h xda 

n „t ■ 

+ \\J2 I {E h {t])-E h {a))P h xda 

As in (4.5) let (^h,m)m=i De the positive eigenvalues of Ah with corresponding 
orthonormal eigenvectors (</ 3 ?i,m) m !Li C Sh- Then, Parseval's identity yields for the 
first summand 

n ft ■ 

(R(kA h y -Ehit^Phxda 



N h n 

rn—l j — 1 

N h n 

^ E {^xl m \R(kX h , m ) j -e 

rn—l j — 1 



(P h X, (fh,r 



- A h . m t i 



(A h 2 PhX.iph^n)' 



As in the proof of Lemma 4.3 (Hi) we first study all summands with k\h,m < 1. In 
this case (4.16) gives 



J _ e -A?,,m*3 



, 2 o -c0-l)feA h| . 
/i.m / c 



p+4 " 

= CA^^e^'-fc 2 ^ j/ce- cjfeAh 



i=i 



< c V m fc / (^ + fc)c- cA — ff d<7 
Jo 

£±1/1 k 
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For all summands with kXhm > 1 we have the estimates 



Ah. rati 



3=1 



< k^J2 kX h,m(\ R ( kX h,m)\ j +e- kXh 



,ra3 



3=1 



' 3=1 3=1 

where we used (4.17) and c _feAh m ' :,_1 ^ < e - ^ -1 ) for kXh, m > 1- Altogether, this 
proves 



(4.19) II {R{kA h y -Ehit^Phxda 



<Ck 2 -rY,( A h^ p hX^h, m ) 2 = Ck 2 -p\\A^P h x\\ 2 . 

3 = 1 

In order to complete the proof of (i) it remains to find an estimate for the second 
term in (4.18). By applying Parseval's identity in the same way as above we get 



3 = 1 ^ 



2J / (Ehitj)- E h {a))P h x&a 

3 = 1 Jt ^ 



EE 

ro=l j'=l " 
7V h n 

E | A LE e-; 

m— 1 j=l 



Ah, m t, _ e -A h , m <r\ dcr 



-kX h 



da 



(A h 2 p h x,(ph, m y 



Since it holds that 







h <'"' T ) dcr = ke- kXh ' m - (1 - c - kXh -™) 



we have 



4 m E e " Ah ' rat ' -1 

3 = 1 



A? „, ( ke- kXh - m ~ 



-kXh 



-X h , m a\ 



\^ ( 



3 = 1 



k\ h , m (j-l) 



K^\k\ h , m e- k ^-(l-e- k ^)\*(l-e 



-kXh, m \ I 2 /-! _„-A:Afc, ra \- 2 



Further, if kXhm < 1 then it is true that 



kX h , m G~ kXh m - (1 - c- fcAh 



— 2ftAh I ^/c Aj, 



1-kXh 



< Cfc 4 A 



4 \4 



h.m ' 
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Thus, in this case we derive the estimate 



-Ai, ra <r"\ 



do- 



o 



< Cfc 2 ^(fcA fe , w ) p - - 2 < Ok 2 -", 

where we have used that the function x >-> x(l — e - *) -1 is bounded for all x £ (0, 1]. 
On the other hand, if kXh, m > 1 then we have 

ffe 



-kXh 



d(T 



-kX h 



1 - e 



-fcA», 



2 Vi- e -^, m )- 2 <c fe 2 -", 



since A£ < fc 2_p , sup a;>0 xe _a: < oo and (1 - c ~k\ htm y2 < ^ 



-i\-2 



Alto- 



gether, this yields 



3=1 



^ / (^(tj) - E h (a))l\x da < Ck 2 ~ p ^ (A h 2 P h x, <p hifn )' 



772 — 1 



which in combination with (4.19) and (3.8) completes the proof of (i) for p £ {0, 1}. 
The intermediate cases follow again by interpolation. 

As above we begin the proof of (ii) with the remark that without loss of generality 
we can assume that t = t n for some n > 0. In a similar way as in the proof of (i) 
we have 



LF fc fc(er)x da) < 



tn 



tn 



(E(a) - E(t))x\\ 2 da) 2 + (i - t n )i \\F kh (t)s 



For the second term Lemma 4.3 (i) with p, = 1 + p and v = p together with 
gives the desired estimate. The first summand is estimated by 
Lemma 2.4 (ii) which gives 



(E(a) - E(t))x\\ da) = / \\E{a)A~^{I - E(t - a))A*x\\ da 



Further, we have 

ft n 



\F kh (a)x\\ da) < 



ii 



< 



tn 



(t-a) p da) \\x\\ p <Ck- 



\{E kh (a) - E h (a))P h x\\ da 

fin 



\(E h (a)P h -E(a))x\\ da 
and Lemma 4.2 (ii) yields an estimate for the second summand of the form 
\{E h {a)P h - E{a))x\\ 2 daY < Ch 1+P \\x\\ p . 



o 

Thus it remains to show 







(E kh (a)-E h (a))P h x\\ z da) < Ck~ 
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As above, we prove this estimate for p £ {0, 1}. The intermediate cases follow again 
by interpolation. By the definition of E k h we obtain the analogue of (4.18), namely 



\(E kh (a) - E h (a))P h x\\ 2 da) 2 
(4.20) -(X/ \\(R(kA h y -E h (t 3 ))P h x\\ 2 da 



(E / da) 2 . 



For the square of the first summand we again apply Parseval's identity with re- 
spect to the orthonormal eigenbasis (<Ph,m)m=i c &h 01 ^/i with corresponding 
eigenvalues (\h, m )^ =1 and get 

]T / ||(i?(fcA) j -^fe))^H da 

n AT,, 

= E fc E VJ^w - e- fcAh - mj f(4 lW ^. vh,m) 2 - 

j — 1 m— 1 

For all summands with k\h, m < 1 we apply (4.16). This yields 
V / ||(fl(fcA fc y - ^(^))Ax|| 2 da 

,•=1^-1 

m— 1 7 — 1 

< C V fc 2 A^e 2cfeA ^ / (a + fc) 2 c- 2cA ^ CT da(^ P , Wm ) 2 
m=i ^ 



(4.21) 



< Cfc 1+p ||yl|Pha;|| 2 . 

For the remaining summands with k\h,m > 1 we use (4.17) and get the estimate 

n n 

J2 \R(k\ h , m ) j - e-^-^f < 2^ ((1 + c)- 2 ^ + (T 2 ') < C, 

3=1 3=1 

where the bound C is independent of n. Hence, also in this case we have 
/ \\{R(kA h y - Euit^Puxf da 

(4.22) 

V ' N h 

< C E k K P m{ A l p hX,Vh, m ) 2 < Ck 1+p \\Alp h xf. 

m—1 
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Next, we prove a similar result for the square of the second summand in (4.20). As 
in the proof of part (i) an application of Parscval's identity yields 

/ \\{E h (t 3 )-E h (a))P h x\\ 2 da 

n N h -tj 

= E E / \e- kXh ' mj - e-^\ 2 da(AtP h x,<p h , m ) 2 . 



j — 1 m— 1 

By using the fact 



/ | e -fcAh,mj _ e -A h . m a|2 dcr < e -2k\ h , m (j-l) k (i_ e -k\ h , m y 



we obtain 



]T / \\(E h {tj) - E h (a))P h x\\ 2 da 



iV fe n 

< E A^(i-e- fe ^) 2 (4^^, m ) 2 E e " 2feA ' , '"' (J '- 1) 

m— 1 J — 1 

< E ^*(l-e- fcAfc -) a (AJfl k x, PfclW ) a (l-e- a ^.-)- 1 . 
Since 1 - e _2fcAh ™ = (1 + e -fcAh '"»)(l - e ~ fcAh m ) we conclude 

E / II da 

N h 

< E V-.*( 1 - e "* Ah -")(^ , Vr.^) a ( 1 + e " fc ^)" 1 

m— 1 

< -o k E A r m (! - c- feA '-)(4p^,^, m ) 2 



2 

m— 1 



If p = this simplifies to ifc||Pfca;|| 2 since 1 — e fcAh m < 1. If p = 1 then we 

use 1 — c -fcAh > m < kXh.m and the right hand side is bounded by \k 2 \\ A£ Phx\\ 2 . 
Altogether this proves 

(4.23) Y \\(E h {t j )-E h {a))P h xf&a<Ck 1+ r\\AlP h x\\ 2 . 

Finally, a combination of (4.20) with (4.21), (4.22) and (4.23) gives 

(4.24) (y \\(E kh (a)~E h (a))P h x\\ 2 day < Ck^\\Afp h x\\ 

which completes the proof of the case p = 0. For the case p = 1 wc additionally 
use (3.6) and Assumption 3.2 which yields 

\\AlP h x\\ = \\PhX^ < C||z||i 

for all x G H 1 . This completes the proof of the Lemma. □ 
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5. Proof of Theorem 1.1 



The aim of this section is to prove the strong convergence of the spatially semidis- 
crete approximation (1.4) to the mild solution (1.2) of (1.1). 

The following two lemmas are useful for the proof of Theorem 1.1. The first 
lemma is a special version of [7, Lemma 7.2] and is needed to estimate the stochastic 
integrals. The second is a generalized version of Gronwall's lemma. A proof of this 
version can be found in [10] or in [16, Lemma 7.1.1]. 

Lemma 5.1. For any p G [2,oo), t G [0,T] and for any L\-valued predictable 
process $(0"), o~ G [0, t], we have 



E 



$(tr) dW(a) < C(p)E 



I Will? dCT 



Here the constant C (p) can be chosen to be 



C(p) 



;(p-i: 



V 



Lemma 5.2. Let the function ip: [0,T] 



p(f-i) 

p-l) 

be nonnegative and continuous. If 



(5.1) <p(t) =C X +C 2 \ (t- a)~ 1+0 ip(a) da 

Jo 

for some constants C\, C% > and ft > and for all t £ (0, T] , then there exists a 
constant C = C(C2,T, ft) such that 

f(t) < CCi, for all t g (0,T]. 

After these preparations we are ready to prove the first main theorem. 

Proof of Theorem 1.1. For t £ (0,T] we have by (1.2) and (1.4) 
(5.2) 



\X h (t)-X(t)\\ Lp(n;H) < \\F h (t)X 



+ 



ft 



E h {t-a)P h f(X h (a))da- / E(t - a)f(X(a)) da 



LP(fl;H) 



E h (t-a)P h g(X h (a))dW(a)- / E(t - a)g(X(a))dW(a) 

Jo LP(n;H) 

where Fh(t) = Eh(t)Ph — E(t). The first term is estimated by Lemma 4.1 (i) with 
fx = v = 1 + r, which yields 



(5.3) nh 1 +r\ 
The second term in (5.2) is dominated by three additional terms as follows 



\ F ^)M\ W) < Chl+r \\ A ^M W ) 



E h (t-a)P h f(X h (a))da- / E{t - a)f(X(a)) da 



LP{U;H) 



< 



E h (t-a)P h (f(X h (a)) - f(X(a))) da 



(E h (t - a)P h - E(t - a)) (f(X(a)) - f(X(t))) da 



Lp{n-H) 



(E h (t - a)P h - E(t - a))f(X(t)) da 
h +h + h- 



Lp(Q;H) 
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We estimate each term separately. First note that, by interpolation between (3.5) 
and (3.9), we have \\Eh(t)Phx\\ < Ct~~i~ ||x||_ 1+r . Together with Assumption 2.1 
this yields 

rt 



(5.4) 



h< / \\E h (t-a)P h (f(X h (a))-f{X{a)))\ 



<C / (t-a)-^\\X h (a)-X(a)\\. p 



LP(fl;H) 



LP(fl;H) 

do-. 



do- 



The term I2 is estimated by applying Lemma 4.1 (Hi) with p = 1 — r, Assumption 
2.1 and (2.4) with s = 0. Then we get 



h < 



\F h (t-a)(f(X(a))-f(X(t)))\ 



lp(u-h) 



da 



(5.5) 



< Ch 1+r j (t - -)- 1 ||X(a) - X(t)\\ LP{n . H) da 

< Ch 1+r [ (t - o-)- 1+ 2 da < Ch 1+r . 

Jo 



Finally, the estimate for I3 is a straightforward application of Lemma 4.2 (i) p 
1 — r. A further application of Assumption 2.1 and (2.3) gives 

< Ch 1+r . 



(5.6) h<Ch 1+r \\f(X(t))\\ <Ch 1+r 1+ sup \\X(a)\\ H . 

v <tG[0,T] v ' ' 

The right hand side of this estimate is finite in view of (2.3). A combination of the 
estimates (5.4), (5.5), and (5.6) yields 



(5.7) 



E h (t-a)P h f(X h (a))da- / E(t — a)f(X(a)) da 



Lp(n-H) 



<Ch' +r + C (t - a)^\\X h (a) - X(a)\\ da 



Next, we estimate the norm of the stochastic integral in (5.2). First, we apply 
Lemma 5.1 and get 



Lp(VL;H) 



E h {t - a)P h g{X h (a)) dW(a) E(t - a)g{X{a)) dW{a) 

Jo 

< C(E [( J \\E h (t - a)P h g{X h {a)) - E(t - a)g(X(a))\\l„ da) ' 
The right hand side is a norm. Hence, the triangle inequality gives 
(e[(^ \\E h (t - a)P h g(X h (a)) - E(t - a)g{X{a))f L% da 
\E h (t - a)P h (g(X h (a)) - g(X(a))) |£ g da 

1 

\F h (t-a)(g(X(a))-g(X(t)))\\ 2 L0 da 



LP(f2;I 
Lp(Q;R) 



F h (t-a)g(X(t)) L0 da 

h + I 5 +I 6 . 
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In a similar way as for I\, we find an estimate for I 4 by using the stability of the 
operator Eh(t)Ph, that is, (3.5) with p = 0. Together with Assumption 2.2 we get 

ft 

h<C 



(5. 



= C 
< C 



\X h (a) - X(a)\\ da 

f \\X h (a)-X(a)fda 
Jo 



\X h (a)-X(a)\ 



Lp(n-H] 



LP/ 2 (Q;R 

6a) 



For the estimate of term J5 we apply Lemma 4.1 (i) with fx = 1 + r and v = 0, 
which gives < C/i 14 -^-^ 1 . Additionally, we use the fact that ||LA/||^o < 

11-^11 ll-^IU f° r an y bounded linear operator L: H — > and Hilbert-Schmidt oper- 
ator M £ LP,. The estimate is completed by making use of Assumption 2.2 and the 
Holder-continuity (2.4) with s = 0. Altogether, we derive 

ft 



h < Ch 



l+r 



(t-a) 



\X(a) - X{t) da 



< CTi 1 



/ (*-^)- W H^)-^(*)HL(n ; H)^)' 



<Ch 1+r ( / (t-cr)- r d(j) 2 <C7l 



,1+r 



(5.9) 



Note that in the last step the generic constant C depends on r e [0, 1) and blows 
up as r — > 1. 

Finally, for Iq, let (ip m )m>i denote an arbitrary orthonormal basis of the Hilbcrt 
space Uq. Then, by using Lemma 4.2 (ii) with p = r, Assumption 2.2 and (2.3), 
we get 

J 6= (£ / ||^(t-^W))^m|| da 



,-1 Jo 



LP(n;I 



(5.10) 



2\ 2 



m— 1 

= Ch 1+r \\\\g(X(t))\\ L0 

II " "^.r LP(fi 

<Ch 1+r (l+ sup |UiX(a)|L <C/i 



l+r 



<t£[0,T] 

In total, we have by (5.8), (5.9), and (5.10) that 



(5.11) 



E h (t - a)P h <?(X h (a)) dW» / E(t - a)g{X{a)) dW{a) 



lp{q,-h) 



< Ch 1+r + C 



\X h (a) - X(a)\\ LP(n . H) 



da 



Coming back to (5.2), by (5.3), (5.7), and (5.11) we conclude that 

\\X h (t) - X{t)\\l P(n . H) < Ch 2 ^ + C [ \\X h (a) - X{a)\\\ p{n . H) da 



+ C{ / (t-a)~=- X h (a)-X(a) L 



da 
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Finally, we note that 

\X h (a) - X{a)\\ 2 LP{n . H) da 
j-t 

= / (t-<r)^(t-<r)-^\\X h (a)-X(<T)\\l PiUH) d<T 
Jo 

< j\t - o)-^\\X h {a) - X{<j)f LP{n . H) da, 



o 

and by Holder's inequality 



= J o (t - <?r^ (t - o-y^WXhia) - X(a)\\ LP{n . H) da 

(t - o)- 1 -^ \\X h (a) - X(a)f LW) da 



o v 1 - rt 



< T— 

- \r + 1 

ll 1 1 2 

Hence, for ip(t) = \\Xh{t) — X(t)\\ LI1 , n . H , we have shown that 

p-t 

<p(t) < C7i 2(1+r) +C (t- a^ipia) da 
Jo 

and Lemma 5.2 completes the proof. □ 



6. Proof of Theorem 1.2 

This section is devoted to the proof of Theorem 1.2. As in Section 5 the proof 
relies on a discrete version of Gronwall's Lemma. Here we use a variant from [10, 
Lemma 7.1]. 

Lemma 6.1. Let C\,C-z > and let (tpj)j=x,...,N k be a nonnegative sequence. If 
for [3 £ (0, 1] we have 

j'-i 

(6.1) <pj < d + C 2 k tj-t^i for allj = l,...,N k , 

i=l 

then there exists a constant C = C(C2,T, f3) such that 

<Pj<Cd for all j = l,... ,N k . 
In particular, the constant C does not depend on k. 

Proof of Theorem 1.2. In terms of the rational function R{kAh), which was intro- 
duced in (4.5), we derive the following discrete variation of constants formula for 

K 

(6.2) 

j'-i i-i 
X{ = R(kA h yP h X -kJ2R(kA h y-*P h f(X l h ) +^2R{kA h y- i P h g(Xi)AW i+1 . 

i=0 i=0 
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By applying (1.2) and (6.2) we get for the error 
(6.3) 

\\X{-X(t 3 )\\ Lv( ^ H) < \\R(kA h yp h x -E(t 3 )x \\ LP{n . H) 

+ \kY^R{kA h y- l P h f(X l h ) - f 3 E(tj - a)f{X{a))da 

1 .,-n JO 



i=0 
3-1 



Lp(Q;H) 



+ II R{kA h y- l P h g{Xl)^W l+1 - [ 1 E(t 3 - a)g{X{a)) dW(a) 

" 4=0 J ° 



The first summand is the error of the fully discrete approximation scheme (4.3) for 
the homogeneous equation (4.1) but with the initial value being a random variable. 
By Assumption 2.3 we have that Xo(uj) 6 H 1+r for P-almost all lj € CI. Thus, the 
error estimate from [27, Theorem 7.8] yields 

\\R(kA h yP h X - *%)Xo|| LP(n . ff) < C(h 1+r + k^WA^XoW^^. 
For the other two summands we introduce an auxiliary process which is given by 
X kh (t):=Xl-\ ift G (tj-utj], j = l,...,N k , 
X kh (0) := P h X . 

By this definition Xkh is an adapted and left-continuous process and, therefore, 
predictable [25, p. 27]. Recalling the definition (4.11) of the family of operators 
Ekh(t), t > 0, we obtain 



(6.4) 



(6.5) 



J-l 3-1 f t i+1 

kJ2R(kA h y- l P h f(X l h ) = J2 / E kh(tj - a)P h f(X kh (a)) da 



i=0 



i = Q 

ft; 



E kh (t 3 -a)P h f(X kh (a))da 



and, analogously, 

(6.6) ^RikAhy^PhgiXDAW^ 1 = / E kh (t 3 - a)P h g(X kh {a)) dW{a). 



i=0 



By applying (6.5) we have the following estimate of the second summand in (6.3) 



E(t 3 -a)f(X(a))da 



3-1 

\kY,R(kA h y-*p h f(xi)- 

i=0 

(E kh (t 3 - a)P h f{X kh (a)) - E{t 3 - a)f(X(a))) da 



Lp(n-H) 



Lp(U:H) 



< 



E kh {t 3 - a)P h (f(X kh (a)) - f(X(a))) da 



Lp(n-H) 



+ 



(E kh (t 3 - a)P h - E(t 3 - a)) (f(X(a)) - f{X{t 3 ))) da 



lp(q-h) 



(E kh (t 3 - a)P h - E(t 3 - a))f(X(t 3 )) da 
=: Ji + J 2 + J 3 - 



Lp(Q;H) 



Note, that the terms Ji, J2, and J3 are of the same structure as the terms Ix, I2, 
and I3 in the proof of Theorem 1.1. Since with Lemmas 4.3 and 4.4 we have the 
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time discrete analogues of Lemmas 4.1 and 4.2 at our disposal the proof follows the 
same path. 

For the term Ji we first note that an interpolation between (4.8) and (4.12) yields 
\\E kh (t)P h x\\ < C^^ r ||x||_ 1+r for all t G [U-uU) and all x G H- 1+r . Together 
with Assumption 2.1 this gives 



Ji < 



\E kh (tj - a)P h (f(X kh (a)) - f(X(a))) \\ LP 



o 

■J'- 1 rti+i 



da 



i=0 
3-1 

< CkJ^t 

i=0 
3-1 



t ]-i \\ X h ~ X ( a )\\LP(n;H) 
\\ X h - X ( t i)\\ L P(n;H) 



da 



i=0 



\X(U) - X(a)\\ LP{n , H) 



da. 



By the Holder continuity (2.4) we get 



j-i 

■i i_ r 

Ev~ 

i=0 



\ X (ti)- x (<r)\\ LP(Q . !H) da 



3-1 

i=0 



3-1 



(a-t i )ida=-CkiJ2 t 3-^ 



< Ck? / a~— da < Ck*. 



Altogether, Ji is estimated by 



3-1 



(6.7) 



J^Cki+Ok^tj-I ll^-^(^)|| LP m ; m- 



i=0 



For the estimate of J2 we first note that F k h(t) = E k h(t)Ph — E(t) and, hence, we 
can apply Lemma 4.3 (Hi) with p = 1 — r. In the same way as in (5.5) we obtain 

(6.8) J 2 < C(h 1+r + k k $ L ). 

Likewise, but this time by an application of Lemma 4.4 (i) with p = 1 — r, we 
proceed with the term J3 in same way as in (5.6). By also using (2.3) this yields 



•9) J 3 <C(h 1+r + k 1 ^ L )(l+ sup \\X(a) 



re[0,T] 



\LP{Q,:H) 



< 



C(h l+r + k-^). 



It remains to estimate the third summand in (6.3) which contains the stochastic 
integrals. With (6.6) and Lemma 5.1 we get 



Vfi(fc4)'-'A 9 (^)Ar +1 - f J E{t, J -a)g{X{a))dW{a) 

i-n Jo 



LP(fl;H) 



E kh {t, - a)P h g{X kh {a)) dW{a) - / E(tj - a)g(X(a)) dW{a) 



Lp(Q;H) 



< C E 



\E kh (tj - a)P h g{X kh {a)) - E{t, - a)g{X{a))\\ T0 da 
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In the last step Lemma 5.1 is applicable since by our definitions (4.11) and (6.4) the 
process [0, tj] 3 a i-» E k h(tj — v)Phg{X k h(a)) & L° is adapted and left-continuous 
and, therefore, predictable. 

Next, we use the triangle inequality and obtain 



E 



\E kh {t 3 - a)P h g{X kh {a)) - E{t 3 - a)g(X(a))\\ . da 



< 



\E kh {tj - a)P h (g(X kh (a)) - g(X(a))) \\ 3 da 



\F kh (tj - a){g(X{a)) - g(X(tj))) ||* da 



LP(Q;t 
LP(fi;R) 



\\F kh (t J -a)g(X(t J ))\\ L „da 
=: J4 + J5 + Je- 



LP(0;I 



For the estimate of J4 we use the facts that \\E k h{t)Phx\\ < C\\x\\ for all x £ H 
as well as || LM \\ L a < ||i|| ||iW|| for all linear bounded operators L: H — »• H and 
M £ L\. Together with Assumption 2.2 and the same technique as in (5.8) we get 



J 4 < C 



\X kh {a) - X{a)\\ da 



LP(0;] 



<C( \\X kh {a)-X{a)\\ LP{n . H) da 



= c(kj2\\xi-x(t i )\\ 2 LP(a . H) 

i=0 



i=0 



X(U)-X{a)\\ LP{n . H) da 



i=0 



By the Holder continuity (2.4) it holds that 

\\X(ti)-X(a)\\ 2 LP{a . H) da<Cj2 / {a-t t )da<Ck. 

i=0 Jt > 
Hence, we have 

(6.10) 



,h < ck^ + c(kJ2 \\Xh - x(u)\\ 2 LHn . H 



i=0 



In the way same as in (5.9), we apply Lemma 4.3 (i) with fi = 1 + r and v = and 
we derive the estimate 



.11) 



J 5 <C(h 1+r + k^), 



where we also used Assumption 2.2 and (2.4). By (2.3) and the same arguments 
which gave (5.10) we get 



.12) 



J 6 < C(h 1+r + k 1 ^ 
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To summarize our estimate of (6.3), by (6.7) to (6.12) we have shown that 

\\ x l - *(*;)llL.(n,io ^ c ( hl+r + kh + 

3-1 1-r 

+ C ( k ^2 t j-i \\ X h~ X ( t i)\\ L p(n;H) 
i=0 

3-1 

+ Ck22\\Xl - X(ti)\\ LP(n . H y 
i=0 

Further, we have 

*E IW - X ^)llL(n ;ff ) < \\ x t X ^)\\Un-.HY 

i=0 i=0 

and by Holder's inequality 
J'- 1 _ i=r 

^ (*£*7-? E )*(*£*7-? E ll jr A- Jc: (*')llL(n^) 

i=0 i=0 



2 



3-1 



i=0 



2 

n;J?) 



Hence, by setting = — X(tj)\\ Lp ,^ H . we have proven that 

3-1 

<Pi < C(h 1+r + + k^f + Ck^tJZTtpi. 

i=0 

An application of Lemma 6.1 completes the proof of the theorem. □ 

7. Additive noise 

In this section we focus on stochastic partial differential equations with additive 
noise, that is, the L^-valued function g does not depend on X. Thus, the SPDE 
(1.1) has the form 

dX(t) + [AX{t) + f(X(t))] dt = g dW(t), for < t < T, 

(7,1) X(0) = X o . 

Numerical schemes for the approximation of SPDEs with additive noise have 
been extensively studied. For example, for schemes which involve the finite element 
method we refer to [14, 21, 28] and the references therein. 

For additive noise Assumption 2.2 simplifies to 

Assumption 7.1. There exists r G [0,1] such that the HUbert- Schmidt operator 
g G L\ satisfies 

IMUg, r < °°- 

Recall that in the case, where U = H and g : H — > H is the identity, Assumption 
7.1 reads as follows 

oo 

ll.9!lL«. r = ll^.9llL« = Ell A§ Q^™l| 2<0 °' 

rn—1 
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where (ip m )m>i denotes an arbitrary orthonormal basis of H . In particular, if r = 
we have ||g||j,o = Tr(Q) < oo which is a common assumption on the covariancc 
operator Q (see [7, 25]). 

Under Assumptions 2.1, 2.3 and 7.1 with r £ [0, 1], p £ [2, oo) there exists a mild 
solution X: [0,T] x!!^ H 1+r to (7.1) (see [7] and [22, Corollary 5.2]). 

In particular, we stress the fact that the parameter value r = 1 is included for 
SPDEs with additive noise. As the next corollary shows the same is true for the 
error estimates of the numerical approximations (1.4) and (1.5). 

Corollary 7.2. Let Assumptions 2.1, 2.3 and 7.1 hold for some r £ [0,1], p £ 
[2, oo). Let X denote the mild solution to (7.1). 

(i) Under Assumption 3.1 there exists a constant C, independent of h £ (0,1], 
such that 

\\X h (t) - X(t)\\ LP{Q . H) < Ch 1+r , for all t £ (0,T], 

where Xh is the corresponding spatially semidiscrete approximation (1.4). 

(ii) Under Assumptions 3.1 and 3.2 there exists a constant C, independent of 
k,h £ (0, 1], such that 

IN - *(*j)IL (ni jj) ^ c ^ 1+r + fc *)' f° r aU i = • • • ' N >» 

where X J h is the corresponding spatio-temporal discrete approximation (1.5). 

Proof. For (i) and (ii) the assertion follows directly from Theorems 1.1 and 1.2 for 
all parameter values r £ [0, 1). 

A close inspection of the proof of Theorem 1.1 shows that the condition r < 1 
is only required in the estimate (5.9). But in the case of additive noise the term L 5 
is equal to 0. Hence, the convergence result holds true by the same arguments for 
r = 1. 

The same is true for (ii) where the condition r < 1 only shows up in the estimate 
of (6.11). □ 

We remark that a similar convergence result to (i) can be found in [21, Prop. 2.3]. 
But there the authors had to incorporate a singularity of the form max(0, log(^)) 
which is now removed. 
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